The ManArray( Embedded Processor Architecture
نویسندگان
چکیده
The BOPS ManArray architecture is presented as a coprocessor platform for the embedded processor domain, consisting of scalable design points. As an array processor, a single architecture definition and tool set supports multiple configurations of processing elements (PEs) from low end single PE to large arrays of PEs. The ManArray selectable parallelism architecture mixes control-oriented operations, practical VLIW, packed data, and distributed array processing in a cohesive independently selectable manner. In addition, scalable conditional execution and single cycle communications across a high connectivity low cost network are integrated in the architecture allowing another level of selectivity that enhances the application of the parallel resources to providing high performance algorithms. Coupled with the array coprocessor is a scalable DMA engine that runs in the background and provides programmer selectable data distribution patterns and a high bandwidth data streaming interface to system peripherals and global memory. This paper introduces the embedded scalable ManArray architecture and selectable parallelism. A number of benchmarks are presented demonstrating the capability of this new architecture. For example, a 2x2 ManArray can process a distributed 256 point complex FFT in 425 cycles and a 8x8 2D IDCT in 34 cycles that meets IEEE standards.
منابع مشابه
ManArray Processor Interconnection Network: An Introduction
The present paper introduces the new interconnection network of the BOPS ManArray family of available core products. To form a ManArray network, the processing elements are completely connected within clusters and communicate with members of only two other clusters thereby reducing signal fan-out and wiring density. With this simple network, single-step communications between a hypercube and it...
متن کاملMT mediaLibTM for Chip MultiThreaded (CMT) Processors
Innovation in processor design tends to happen in waves. The introduction of the first 4-bit/8-bit microprocessor designs (the 4004/8008) by Intel in 1971/1972 triggered a wave of competing designs from other semiconductor companies, including Motorola, Zilog, MOS Technology, TI, Rockwell, RCA, Fairchild, and others. The SPARC® architecture was part of a wave of RISC processor designs that appe...
متن کاملH.263 Video Encoder Implementation on Bops Manta Processor
An H.263 video encoder implementation without negotiable coding options on BOPS Manta DSP is presented. First, the ManArray architecture is described. Then, partitioning between the multiple processing elements (PEs) and implementation issues of the video encoder are discussed. Based on the results of this experiment, we discuss the suitability of this parallel implementation platform for video...
متن کاملHigh-performance FFT implementation on the BOPS ManArray parallel DSP
We present a high performance implementation of the FFT algorithm on the BOPS ManArray parallel DSP processor. The ManArray we consider for this application consists of an array controller and 2 to 4 fully interconnected processing elements. To expose the parallelism inherent to an FFT algorithm we use a factorization of the DFT matrix in Kronecker products, permutation and diagonal matrices. O...
متن کاملUltra-Low-Energy DSP Processor Design for Many-Core Parallel Applications
Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...
متن کامل